Mon. Not. R. Astron. Soc. 000, [j]-|l3] (2000) Printed 1 February 2008 (MN M^jX style file vl.4) 



The 1-point PDF of the Initial Conditions of our Local 
Universe from the IRAS PSC redshift catalogue 



P. Monaco 1 ' 2 , G. Efstathiou 2 , S. J. Maddox 2 , E. Branchini 3 , 

C. S. Frenk 4 , R. G. McMahon 2 , S. J. Oliver 5 , M. Rowan-Robinson 5 , 

W. Saunders 6 , W. J. Sutherland 7 , H. Tadros 7 , S. D. M. White 8 

1 Dipartimento di Astronomia, Universita di Trieste, via Tiepolo 11, 34-131 Trieste, Italy 

2 Institute of Astronomy, Madingley Road, Cambridge CB3 OH A, UK 

3 Kapteyn Sterrewacht, Rijksuniversiteit Groningen, Postbus 800, 9700, AV Groningen, The Netherlands 
4 Department of Physics, University of Durham, DH1 3LE, UK 

5 Blackett Laboratory, Imperial College, Prince Consort Road, London SW7 2BZ, UK 

6 Institute for Astronomy, Blackford Hill, Edinburgh EH9 3RJ, UK 

7 Nuclear and Astrophysics Laboratory, Keble Road, Oxford OX1 3RH, UK 

8 MPI-Astrophysik,Karl-Schwarzschild-Strasse 1, Garching bei Munchen, Germany D-85740 



Received 2000 



ABSTRACT 

The algorithm ZTRACE of Monaco & Efstathiou (1999) is applied to the IRAS 
PSCz catalogue to reconstruct the initial conditions of our local Universe with a reso- 
lution down to ^5 h^ 1 Mpc. The 1-point PDF of the reconstructed initial conditions 
is consistent with the assumptions that (i) IRAS galaxies trace mass on scales of ^5 
hr 1 Mpc, and (ii) the statistics of primordial density fluctuations is Gaussian. We 
use simulated PSCz catalogues, constructed from N-body simulations with Gaussian 
initial conditions, to show that local non- linear bias can cause the recovered initial 
PDF (assuming no bias) to be non-Gaussian. However, for plausible bias models, the 
distortions of the recovered PDF would be difficult to detect using the volume finely 
sampled by the PSCz catalogue. So, for Gaussian initial conditions, a range of bias 
models remain compatible with our PSCz reconstruction results. 

Key words: Cosmology: theory - large-scale structure of the Universe - galaxies: 
distances and redshifts, clustering 



1 INTRODUCTION 

In the gravitational instability scenario, the large-scale 
structure of the Universe is formed by the growth of small 
fluctuations superimposed on a homogeneous background. 
In most inflationary models, these fluctuations are usu- 
ally predicted to follow Gaussian statistics (see, e.g., Linde, 
1990). Non-Gaussian initial conditions are however pre- 
dicted by models based on topological defects (see, e.g., 
Brandenberger 1998), as well as by some inflationary models 
(see, e.g., Salopek 1999). At the present time, perturbations 
at galactic or smaller scales are highly non-linear, while the 
large-scale distribution of galaxies is still in the linear or 
mildly non-linear regime. For these fluctuations, the map- 
ping of the mass distribution from the final to the initial 
configuration is single- valued. The gravitational evolution 
of the matter field can be inverted, even though at rela- 
tively small scales the evolution is beyond the linear stage 



and strong non-Gaussian features in the density field are 
already present. 

Under the assumption that galaxies trace the large-scale 
distribution of mass in a specified way, it is possible to re- 
cover the initial perturbation field which has given rise to 
the observed galaxy distribution. It is interesting to try to 
recover the initial conditions of our local Universe, not sim- 
ply for understanding our local cosmography, but for other 
reasons as well. The inversion itself does not require any 
assumption concerning the detailed statistical properties of 
the density field. A reconstruction of initial conditions can 
therefore be used to test the assumption of Gaussianity. Fur- 
thermore, it can be used in N-body simulations to mimic the 
growth of clustering in our local Universe. This type of sim- 
ulations have many uses, e.g. to produce predictions of our 
local peculiar velocity field, to quantify Malmquist biases in 
peculiar velocity measurements etc. (see, e.g., Kolatt et al, 
1996; Narayanan et al. 1999). 
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The reconstruction of initial conditions is hampered by 
the well-known facts that galaxies are biased tracers of the 
matter density, and that the details of this bias are poorly 
known (see, e.g., Tegmark & Peebles 1998; Catelan et al. 
1998; Lahav & Dekel 1999; Pearce et al. 1999; Benson et al. 
1999; Somerville et al. 1999; Seljak 2000; Sigad, Branchini 
& Dekel 2000). A model-independent inclusion of bias in a 
reconstruction algorithm is not possible at present. Besides, 
for initial Gaussian perturbations the non-Gaussian statis- 
tics of the matter density field is determined by gravitational 
evolution. Galaxy bias influences the statistics of the galaxy 
density field. A reconstruction procedure applied to a biased 
galaxy density field but performed assuming no bias can 
correct only for gravitationally induced non-Gaussianities, 
thus letting bias induce non-Gaussian features in the re- 
covered initial conditions. In other words, there is a de- 
generacy between initial non-Gaussianity and the effects of 
galaxy bias. The Gaussianity of initial conditions is much 
better tested on very large scales by the cosmic microwave 
background (CMB). A careful analysis of the COBE data 
has lead to a detection of small non-Gaussianity (Ferreira, 
Magueijo & Gorski 1998), which is likely to be an instru- 
mental effect (Bromley & Tegmark 1999; Banday, Zaroubi 
& Gorski 1999; Contaldi et al. 1999). Future measurements 
of Boomerang, MAP and PLANCK will either detect non- 
Gaussianity or strengthen the limits on it. Such possible 
non-Gaussian features are anyway small. With plausible ex- 
trapolation, they are likely to remain considerably smaller, 
at the ~10 /i" 1 Mpc scale, than the signal that galaxy bias 
can induce in a reconstruction scheme as the one presented 
here (see Verde et al. 1999). As a consequence, the statistics 
of the reconstructed initial conditions can be used to give 
new valuable constraints on galaxy bias models, once the 
Gaussianity of initial conditions is either assumed or tightly 
constrained by the future CMB measurements. 

The mildly non-linear evolution of the matter field is 
well modeled by the Zel'dovich (1970) approximation. It 
describes the evolution of the growing mode of perturba- 
tions into the non-linear regime, as long as the flow is 
still laminar. This restriction to the growing-mode dynam- 
ics gives an advantage over a backward N-body simulation, 
in which noise is amplified as a decaying mode. An inver- 
sion of the Zel'dovich mapping was proposed by Nusser & 
Dekel (1992). Their 'time machine' code requires the ve- 
locity potential field in real-space as an input. When ap- 
plied to a galaxy catalogue, the initial conditions are not 
self-consistently obtained from the observed quantity, i.e. 
the galaxy density field in redshift space. Other methods 
for reconstructing the initial conditions, related to that of 
Nusser & Dekel (1992), were proposed by Gramann (1993) 
and Taylor & Rowan- Robinson (1993). Croft & Gatzanaga 
(1997) developed PIZA, a method based on the least-action 
principle (Peebles 1989) and the Zel'dovich approximation. 
The same least-action principle has been exploited to con- 
struct efficient reconstruction algorithms by Nusser & Bran- 
chini (1999) and by Golberg and Spergel (1999). Weinberg 
(1992) and Narayanan & Weinberg (1998) imposed Gaus- 
sianity of the recovered initial conditions to improve the per- 
formance of a reconstruction algorithm applied to evolved 
Gaussian fields. Narayanan & Croft (1999) compared the 
performance of different reconstruction algorithms to N- 
body simulations. According to them, the most accurate 



reconstruction schemes are that of Narayanan & Weinberg 

(1998) and PIZA. 

The correctness of the reconstruction of the initial con- 
ditions relies on the accuracy of the input galaxy sample, 
and on the extent of the sampled volume. In this regard, 
the uniformity of the galaxy selection and the sky cover- 
age are critical aspects. For this reason, the galaxy samples 
used to construct the galaxy density field have been selected 
from the objects contained in the all-sky point source cat- 
alogue (PSC) of the IRAS satellite, because they are ob- 
served on the whole sky by the same instrument, and IR- 
selected samples are less affected by dust extinction due to 
the Galactic Plane (although good all-sky optical catalogues 
as the ORS (Santiago et al. 1995) or the NOG (Marinoni et 
al. 1999) are now available). The far-IR selection is biased 
against spheroidal objects, so that IRAS galaxies tend to 
miss rich structures (even though at scales larger than ~ 5 
h^ 1 Mpc this effect is small, see Baker et al. 1999). The 
QDOT redshift catalogue (Lawrence et al. 1999) was de- 
fined by sparsely sampling the PSC galactic objects with 
60/i flux larger than 0.6 Jy (the completeness limit). Fisher 
et al. (1995) defined their sample by limiting the PSC to a 
60^i flux of 1.2 Jy. More recently, a redshift catalogue of all 
the PSC galaxies (hereafter PSCz; Saunders et al. 2000) has 
been completed. This catalogue provides the best presently 
available sample to quantify the galaxy density field of our 
local Universe. 

To reconstruct the initial conditions of our local Uni- 
verse, Nusser, Dekel & Yahil (1995) used the real-space den- 
sity field of Yahil et al. (1991), obtained from the IRAS 1.2 
Jy redshift catalogue, smoothed with a Gaussian filter of 
width 10 h^ 1 Mpc, using an iterative technique based on 
linear theory. The peculiar velocity potential, obtained from 
the real-space density field by means of an average non-linear 
velocity-density relation, was given as an input to the Nusser 
& Dekel's (1992) time machine. The 1-point probability dis- 
tribution function (hereafter PDF) of their recovered initial 
conditions was consistent with a Gaussian. A reconstruction 
of the initial conditions was performed also by Kolatt et al. 
(1996), using the 1.2 Jy catalogue, and Narayanan et al. 

(1999) , using the PSCz catalogue. In both papers the Gaus- 
sianity of the initial conditions was not found from data but 
forced, following the Gaussianization technique of Weinberg 
(1992) or Narayanan & Weinberg (1999). The recovered ini- 
tial conditions were used to run N-body simulations to con- 
struct mock peculiar velocity catalogues (Kolatt et al. 1996) 
or to study the effect of biased galaxy formation in the re- 
construction (Narayanan et al. 1999). 

Monaco & Efstathiou (1999, hereafter ME99) have re- 
cently described an algorithm (ZTRACE) for recovering the 
real-space density field, the peculiar velocities and the ini- 
tial conditions from a redshift catalogue with known selec- 
tion function. ZTRACE is based on a self-consistent solution 
of the Zel'dovich approximation: it finds, with an iterative 
scheme, the set of initial conditions which evolve into the 
observed redshift-space density field, under the assumption 
that galaxies trace mass. The algorithm has been tested us- 
ing N-body simulations. When the density field is smoothed 
with a mass-preserving adaptive scheme, the initial condi- 
tions and their PDF are reconstructed in an unbiased way, 
with the exception of the high peaks whose density contrast, 
linearly extrapolated to the present time, is J> 1. These have 
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already gone into the highly non-linear regime. As shown by 
ME99, the quality of the reconstruction is almost indepen- 
dent of the assumed background cosmology. 

In this paper we apply the ZTRACE algorithm to the 
PSCz catalogue. In this way we recover the initial conditions 
of our local Universe on scales as small as ~5 ft -1 Mpc, 
thus pushing the reconstruction to the limit beyond which 
high non-linearity prevents any recovery. Here we focus on 
the reconstructed PDF of the initial conditions, testing its 
Gaussian shape; future work will address the use of the re- 
constructed initial conditions to simulate our local Universe. 
Section 2 describes the PSCz catalogue and the construction 
of the redshift-space density field. Section 3 gives a brief de- 
scription of ZTRACE, and describes the results of its appli- 
cation to the PSCz catalogue. In Section 4 we analyze some 
sources of bias in the reconstructed initial PDF, namely the 
high-density tail, the effect of galaxy bias and the errors in 
the selection function. Section 5 describes the reconstructed 
initial PDF of the PSCz catalogue, and Section 6 gives the 
conclusions. 

Throughout this paper distances are given in h~ Mpc, 
with h = Hq /(WO km/s/Mpc). 




Figure 1. The PSCz catalogue in galactic coordinates. The re- 
gions uncovered by the IRAS PSC are masked in grey. 



2 THE PSCZ CATALOGUE 

For this analysis we use the recently completed PSCz cata- 
logue, which is described in detail in Saunders et al. (2000). 
The catalogue is based on the IRAS PSC (Joint IRAS Sci- 
ence Working Group 1988), and comprises all galaxies with 
60/i fluxes > 0.6 Jy. As the PSC is not complete to 0.6 
Jy in 2HCON areas (Beichman et al. 1988), the 1HCON 
detections recovered from the PSC Reject file were added 
to the catalogue. Stars, other galactic sources and AGNs 
were eliminated by means of colour criteria, while residual 
contamination was eliminated by a combination of optical 
identifications, examination of the raw IRAS addscan pro- 
files, examination of the Simbad database and, where still 
unclear, spectroscopic analysis. Redshifts for ~ 4600 galax- 
ies not belonging to the QDOT (Lawrence et al. 1999) or the 
1.2 Jy (Fisher et al. 1995) surveys, and not available in liter- 
ature, were observed with several telescopes. Faint galaxies, 
with Bj > 19.5, were not observed; most of these objects 
are galaxies with z > 0.1, and their omission is irrelevant 
for the present analysis. 

The final catalogue is 98 percent complete in redshift 
over 84 percent of the sky. The missing zones are the Galac- 
tic Plane (where the extinction is Ay > 1.42 mag), the areas 
covered by the LMC and SMC, and two small strips not ob- 
served by IRAS. Fig. 1 shows the Aitoff projected catalogue, 
with the uncovered zones masked in grey. The PSCz cata- 
logue has already been used in other works on the topology 
of the large-scale structure (Canavezes et al. 1998), on the 
cosmological dipole (Rowan- Robinson et al. 1999; Schmoldt 
et al. 1999a), on the beta parameter (Schmoldt et al. 1999b), 
on the power spectrum of fluctuations (Sutherland et al. 
1999), on the redshift distortions (Tadros et al. 1999) and 
on the peculiar velocity field (Branchini et al. 1999). Com- 
parisons with other galaxy and galaxy cluster catalogues 
were presented by Seaborne et al. (1999) and Plionis et al. 
(1999). Reconstruction analyses were presented by Sharpe 



et al. (2000), based on the least-action principle, and by the 
already mentioned paper by Narayanan et al. (1999). 

To calculate the galaxy density field from the PSCz cat- 
alogue it is necessary to know the selection function, which 
gives the number density of objects bright enough to be 
included in the catalogue, as a function of distance. The 
selection function is well fitted by the following functional 
form: 



Mr) 



(r/r,) 1 - 



(1 + (r/r,)7)/5/V 



(1) 



where r = cz/Hq is the redshift distance, but the volume 
is corrected for relativistic effects (assuming an Einstein-de 
Sitter Universe). As ZTRACE applies in a frame comoving 
with the background, we use redshifts in the CMB frame. 
A maximum likehood fit, performed with the method of 
Mann, Saunders & Taylor (1996) and used in Saunders et 
al. (2000), gives the following best values for the parame- 
ters: = 0.0224 h 3 Mpc~ 3 , r* = 64.10 h" 1 Mpc, a = 1.41, 
(3 = 4.32 and 7 = 1.39. The resulting selection function is 
only marginally different from that calculated in the Local 
Group frame. The effects of uncertainties in the selection 
function will be addressed in Section 4.3. 



3 APPLICATION OF ZTRACE TO THE PSCZ 
CATALOGUE 

ZTRACE is based on a self-consistent solution of the 
Zel'dovich (1970) approximation. In the Lagrangian picture 
of fluid dynamics a mass element with initial (Lagrangian) 
comoving position q is displaced to the final (Eulerian) po- 
sition x through a displacement field S: 



x(q,t) =q + S(q,t). 



(2) 



The map S is the dynamical variable in this approach. For 
small displacements, and for irrotational fluids, the first- 
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Figure 2. (a) Density field from the IRAS PSCz catalogue, adaptively smoothed over 7.6 ft, -1 Mpc within 80 h' 1 Mpc. The Super- 
Galactic Plane is shown. The peaks correspond to well known superclusters; see text for details, (b) Initial density field, linearly 
extrapolated to the present time. 
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order growing mode gives the Zel'dovich (1970) approxima- 
tion: 

x(qt) =q-£>(t)Vy>(q). (3) 

Here D(t) is the linear growing mode (see, e.g., Peebles 
1980); for an Einstein-de Sitter universe it is equal to the 
scale factor a(t). The growing mode is normalized so that 
D(to) ~ 1, where to is the present time. <£>(q) is a rescaled 
version of the peculiar gravitational potential, such that, at 
an initial time U, V 2 tp — S(q,U)/D(ti). The density con- 
trast 5 is defined as usual, <5(x) = (f?(x) — Q)/q, where g is 
the background density. It is useful to define the quantity: 

fl(q) = d(q,tO/D(ti). (4) 

Si is constant in linear theory, and is equal to the initial 
density contrast linearly rescaled to the present time. This 
quantity will be called the linear density contrast in the 
following. 

In a redshift survey, the radial coordinate is the redshift 
distance, not the real distance. It is possible to generalize the 
Zel'dovich map, Eq. ^, to the redshift space s: 

s(q, t) = q - D(t) [V^(q) + f{Q) (V^(q) ■ x) x] . (5) 

Here x is the versor of x, and f(0.) = d In D/d In a is usu- 
ally approximated by f(Q) ~ Q ' 6 (Peebles 1980), or by 
/(n,n A ) ~ n - 6 + fi A (l + n/2)/70 (Lahavetal. 1991) if the 
cosmological constant is non-zero. Notice that the last term 
in the right-hand-side of Eq. ^ is the peculiar velocity along 
the line of sight. 

The density contrast follows from the map in Eq. |§] or 
Eq. ^ through: 

l + <5(q) = [det((5f t + 5 , i ,)]- 1 . (6) 

Here 5^ b is the Kronecker tensor, and S a ,b is the derivative 
of the map S. ZTRACE is designed to invert the non- linear 
and non-local set of equations which connect the initial con- 
ditions to the redshift-space density, under the hypothesis of 
laminar flow, i.e. when the Zel'dovich map in redshift space, 
Eq. |E[ is still single- valued. Multi-valued regions appear as 
soon as different mass elements which lie along the same 
line of sight are observed at the same redshift. This hap- 
pens when the perturbation decouples from the Hubble flow, 
giving rise to multi-stream regions in the distance-redshift 
relation (often referred to as "triple- valued regions"). The 
linear density contrast has a value of order one in such re- 
gions; ZTRACE is not able to reconstruct the high peaks of 
the density distribution. 

ME99 describe in detail those techniques used to help 
the convergence of ZTRACE for smoothed density fields 
with standard deviations a up to 0.7^]. The application of 
ZTRACE to the PSCz redshift catalogue can be summarized 
as follows. 

(i) The masked regions are filled with a synthetic cata- 
logue, so as to produce the galaxy density field over a large 
and connected spherical volume. We have used the proce- 
dure developed by Yahil et al. (1991) as modified by Bran- 
chini et al. (1999). According to this procedure, the zone of 

* The tests performed by ME99 are limited to a < 0.7. Anyway, 
for larger a- values the distortion on the positive tail of the initial 
PDF becomes very strong. 



avoidance of the Galactic disk, with galactic latitude |6| < 8 
degrees, is divided in longitude and redshift bins. The galaxy 
number density in each bin is obtained by interpolating be- 
tween those of the two 8 degree stripes above and below 
the bin. A synthetic galaxy catalogue is then obtained by 
randomly sampling the density field within the bins. In this 
way, the radial distribution follows the same PSCz selection 
function. The other masked regions of the sky which are out- 
side the zone of avoidance are filled at random, according to 
the PSCz selection function. This algorithm is considered a 
better guess with respect to filling at random the masked re- 
gions, as it tends not to break the coherent structures, such 
as the Perseus-Pisces supercluster, which happen to cross 
the Galactic Plane. As a further test, we have also applied 
the lognormal interpolation procedure described in Saun- 
ders et al. (1999); the results are very similar and are not 
described here. The reconstruction method is robust with 
respect to the interpolation algorithm because only a small 
fraction of the sky is masked. 

(ii) The relaxed groups are collapsed into spheroids. The 
effects of the highly non-linear dynamics are apparent in 
the redshift space as elongated structures along the line of 
sight, often referred to as 'fingers of God'. As pointed out by 
Gramann, Cen & Gott (1994), a reconstruction scheme im- 
proves if such non-linear structures are collapsed before ap- 
plication of the algorithm. In ME99 the collapsing procedure 
was found to give a modest increase in the performance of 
ZTRACE. Following ME99, we have found the groups in the 
PSCz catalogue by applying a standard friends-of-friends al- 
gorithm, with radial and tangential linking lengths of 3 and 
0.5 h' 1 Mpc. 

(iii) The catalogue is smoothed to generate a density 
field in redshift space. The performance of ZTRACE im- 
proves considerably if the density field is obtained with an 
adaptive, mass-preserving filtering of the galaxy catalogue. 
This is similar to a smoothing in the Lagrangian space, and 
correctly takes into account that overdensities come from 
larger patches which have contracted, while the opposite 
is true for underdensities. Following ME99, we have con- 
structed the redshift-space galaxy density field by smoothing 
the PSCz point distribution (with the addition of the syn- 
thetic catalogue to fill the masked regions) with the adaptive 
smoothing code of Springel et al. (1998). This code requires 
the definition of a reference smoothing radius, which cor- 
responds to the case of null density contrast; the adaptive 
refinements are then performed so that the smoothing filter 
always contains the same number of objects. As in ME99, 
the reference smoothing radius R is held constant within a 
given distance D, to a value such that 10 galaxies are on 
average contained in the filter at that distance. At larger- 
distances, the reference smoothing radius is increased ac- 
cording to the selection function, so that 10 galaxies are 
contained on average within the filter. In this way, the outer 
parts that are more and more severely smoothed are used 
by ZTRACE to give the external tides to the inner region. 

The density field is calculated on a 64 3 grid of size 
240 Mpc. Fig. 2a shows the obtained density field in 
Super-Galactic coordinates, in the case R=7.6 h' 1 Mpc 
within D=80 h^ 1 Mpc; the severe smoothing of the outer 
regions is apparent. The high peaks in the density distribu- 
tion correspond to well-known structures as the Local Su- 
percluster [at the position (x,y) ~ (0,0)], the Great At- 
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(a) R = 5.1 h" 1 Mpc, D = 50 tr 1 Mpc, cr = 0.40155 



(b) R=7.6 h" 1 Mpc, D = 80 h" 1 Mpc, cr = 0. 29352 
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Figure 3. Reconstructed initial PDFs, averaged over 10 simulated catalogues. The errorbars shown are the square root of the variance 
of the mean among the realizations. The true Gaussian shape is also shown, (a) Smoothing R = 5.1 h~ 1 Mpc within D = 50 h~ 1 Mpc. 
(b) Smoothing R = 7.6 Mpc within D = 80 /i -1 Mpc. The lower panels show the significance of the difference between the simulated 
and true initial PDFs. The lines at ±1<t, ±2<r and ±3<r are shown for clarity. 



tractor region [the two peaks at (x,y) ~ (—40,-20) and 
(—40,20)], the Perseus-Pisces supercluster [the two peaks 
at (x, y) ~ (20, —50) and (40, —20)] and its appendix be- 
yond the Galactic Plane [the Camelopardalis superclus- 
ter, at (x,y) ~ (60,10)], and the Coma supercluster [at 
(x,y) ~ (0,80)] (see, e.g., Branchini et al. 1999). 

(iv) The input density field is padded in the volume out- 
side the largest inscribed sphere (of radius 120 ft -1 Mpc), 
so as to have periodic boundary conditions. This is done so 
that we can apply FFT's to solve the equations. ZTRACE 
is applied to the field; convergence is achieved within ~12 
iterations. Fig. 2b shows the resulting initial conditions in 
the comoving real space (in terms of the linear density con- 
trast Si, Eq. ^J). (Notice that the linear density contrast is 
not bound to be > —1). The main superclusters are origi- 
nated from high peaks in the initial conditions. These peaks 
appear flattened because of the inability, discussed above, 
of ZTRACE to reconstruct the high-density regions. The 
different statistical properties of the initial conditions, with 
respect to the density field in real and redshift space, are 
apparent. 

(v) The PDF of the initial conditions is calculated. In 
this analysis only the grid points within the distance D are 
considered. The points in the zone of avoidance Qb\ < 8°) 
are excluded, so as the central 10 h~ Mpc sphere. This 
last exclusion is motivated by the inability of ZTRACE to 
converge in the neighborhood of the observer; the numerical 
solution tends to oscillate at the origin, where the map given 
in Eq. |^ is singular. To avoid this the density field is damped 
over ~5 h~ x Mpc. The exclusion of the central region is not 



a problem as its volume is small, and the PSCz is known to 
be incomplete at such small distances (Rowan- Robinson et 
al. 1991). 



4 ERRORS AND BIASES IN THE 

RECONSTRUCTION OF THE INITIAL PDF. 

The recovery of the initial PDF can be influenced by high 
non-linearities, by galaxy bias and by errors in the selection 
function. As shown in ME99, a wrong assumption on the 
background cosmology does not influence significantly the 
shape of the initial PDF. 

4.1 The high-density tail 

As mentioned above, ZTRACE is not able to recover the 
high peaks, which have decoupled from the Hubble flow. 
This induces a bias in the initial PDF, whose high density 
tail is suppressed. The underestimated high-density points 
influence the initial PDF also at moderate densities. It is 
useful to quantify this effect before addressing the Gaus- 
sian nature of the initial PDF reconstructed from the PSCz 
catalogue. To this aim, we have extracted mock PSCz cata- 
logues from an N-body simulation of a standard CDM Uni- 
verse (with f2 = 1 and F = Qh = 0.5) with normalization 
erg = 0.7. The simulation, performed with the Hydra code 
(Couchman, Thomas & Pearce 1995), has already been pre- 
sented in ME99. It is performed with 128 3 dark matter par- 
ticles in a box of length 240 h -1 Mpc with a 256 3 base mesh 
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with adaptive refinements. As the volume used in the recon- 
struction is much smaller than the total volume of the sim- 
ulation, it is convenient to extract more than one simulated 
PSCz catalogue from the simulation. We have extracted 10 
catalogues, centred on random points, under the assump- 
tions that galaxies trace mass and applying the PSCz selec- 
tion function (in real space). It has been checked in ME99 
that the selection in real space with a selection function 
determined in redshift space does not influence the result. 
The smoothing radius has been set to R = 5.1 hT 1 Mpc 
(constant within D — 50 h' 1 Mpc) or R = 7.6 h^ 1 Mpc 
(constant within D = 80 /t -1 Mpc). When the smoothing is 
held constant within D — 50 hT 1 Mpc, the volume actually 
used in the reconstruction is 3.8 percent of the total volume 
of the simulation, so that the different realizations are accu- 
rately statistically independent. When the smoothing is held 
constant within D — 80 h~ x Mpc, the volume used is 15.5 
percent, and then the 10 realizations are oversampling the 
volume. This is approximately taken into account by calcu- 
lating the variance among the various realizations as if they 
were 1/0.155 ~ 6.45 instead of 10. 

For each simulated catalogue, the galaxies in the PSCz 
masked area are removed, then replaced using the same al- 
gorithm of Branchini et al. (1999) discussed above. We have 
checked that the masking procedure introduces noise but no 
bias in the reconstruction. The reconstruction proceeds as 
described in Section 3. 

Because of sample variance, the reconstructed initial 
PDFs differ in general in mean ({Si)) and standard devi- 
ation (ai). To construct an average initial PDF it is then 
useful to consider the PDF of the quantity (8i — (&i))/<ji. All 
the distributions are binned in the same way, and averages 
and variances are calculated for each bin. Fig. 3 shows the 
reconstructed initial PDFs, compared with the true Gaus- 
sian curve, for the two smoothing radii used. The upper 
panels show the reconstructed initial PDFs, the errors are 
the square root of the variance of the mean for the ten cat- 
alogues (calculated as if they were 6.45 in the second case, 
see above). The lower panels show the significance of the 
difference between the reconstructed and true initial PDF. 
It is apparent that the true Gaussian shape is recovered ac- 
curately, except in the high-density tail. 

4.2 The effect of galaxy bias. 

The simulated galaxy catalogues are created and recon- 
structed assuming that galaxies trace mass. This may not 
be true in practice, as galaxies may be biased tracers of the 
underlying mass distribution. As mentioned in the Introduc- 
tion, galaxy bias may induce non-Gaussian distortions in the 
initial PDF, when this is reconstructed simply assuming that 
galaxies trace mass. To quantify this effect, it is convenient 
to apply ZTRACE, under the assumption that galaxies trace 
mass, to biased galaxy catalogues that are simulated assum- 
ing some specific bias model. 

We have used a (deterministic, i.e. non-stochastic) 
power- law local bias scheme, already suggested, e.g., by 
Nusser et al. (1995) and by Lahav & Dekel (1999): 

l + 5 s (x) = (l + & )(l + 5(x)) 61 . (7) 

Here 8 g is the galaxy density contrast. This scheme respects 
the condition 8 g > — 1. For small density values this scheme 
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Figure 4. Bias functions used in the paper. Dashed line: anti- 
biased power-law with b\ = 0.5. Continuous line: biased power- 
law with bi = 1.5. Dot-Dashed line: semi-analytical bias. 

reduces to the linear bias, S g = (1 + bo)bi8. bo is constrained 
so that (S g ) = 0. In the following we will consider the anti- 
biased case b\ = 0.5 and bo = 0.084, and the biased case 
b\ — 1.5, and 60 = —0.137. Fig. 4 shows the two bias curves. 

A more physically motivated bias scheme can be ob- 
tained by using semi-analytic galaxy formation models (see, 
e.g., Benson et al. 1999; Somerville et al. 1999). In these 
models, the assembly of dark matter halos is described by 
means of N-body simulations, while the physics of the bary- 
onic component is inserted in the halos through a set of 
simple analytic rules. Galaxy bias can then be obtained by 
comparing the density of galaxies with the density of dark 
matter. Narayanan et al. (1999) give an approximate bias 
function for the IRAS-like galaxies of Benson et al. (1999): 

1 + 5 s (x) = A(l + <Kx)) Q [C + (l + <5(x))< Q -^] " 7 . (8) 

Here a = 2.9 and /3 = 0.825 are the asymptotic logarithmic 
slopes at small and large densities, while A = 1.1, C = 0.08 
and 7 = 0.4. This bias function is shown in Fig. 4. Compared 
to the simple power-law curves, it behaves as in the biased 
case in underdensities, but it is qualitatively more similar to 
the unbiased case in overdensities. We use the semi-analytic 
bias scheme as an exercise to quantify the distortion of the 
initial PDF expected from a more sophisticated bias func- 
tion. It must be noticed, however, that this bias function is 
relative to a flat Universe with Qa = 0.7 and as = 0.9, and 
to a density field smoothed with a top-hat filter with radius 
3 h^ 1 Mpc, while we are applying it to an Einstein-de Sitter 
universe and to a density field Gaussian smoothed over 5 
h- 1 Mpc. 

Following the same procedure as in Section 4.1, we 
have simulated three sets of 10 catalogues, with the three 
bias functions discussed above and shown in Fig. 4. The 
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Figure 5. As in Fig. 3: reconstructed initial PDFs, averaged over 10 simulated catalogues, for biased catalogues, with b\ = 0.5 (a,d), 
bi = 1.5 (b,e) or semi-analytic bias (c,f). The smoothing radius is set to R = 5.1 h~ 1 Mpc within D = 50 ft -1 Mpc (a,b,c) or R = 7.6 
/i.- 1 Mpc within D = 80 /i" 1 Mpc (d,e,f). 



bias schemes are applied to the real-space density field of 
the simulation, smoothed with a Gaussian filter of width 5 

Mpc. Fig. 5. shows the obtained average initial PDFs; 
they are to be compared to those shown in Fig. 3. The Gaus- 
sian initial PDF is again well reconstructed in the "anti- 
biased" power-law case in which 6i = 0.5, even though the 
low-density tail shows a 2-<r discrepancy when R — 5.1 

Mpc. In the "biased" power-law case, that in which 
bi = 1.5 and especially the semi-analytic case, significant 
distortions are apparent. For 7? = 5.1 h -1 Mpc, the tail at 
low densities is overestimated, a dip and a bump are present 
respectively at Si/a ~ and 1.5. The same features are still 
present in the case R = 7.6, although they are less signifi- 
cant. 

Fig. 5 shows that the application of ZTRACE with the 
no-bias assumption can lead to small but significant dis- 
tortions in the reconstructed initial PDF. However, these 
distortions are revealed by averaging over ten realizations 
of the PSCz catalogue, but they are not statistically signif- 
icant for single biased realizations. The PSCz survey is not 



large enough to reveal such deviations, but larger surveys 
such as the 2dF and SDSS surveys may be able to provide 
constraints on non-linear bias schemes. Of course this con- 
clusion may change if stronger bias functions are considered; 
however, evidence on IRAS galaxies suggests that they are 
weakly biased with respect to the matter density (see, e.g., 
Saunders et al. 1999). 

4.3 Errors in the selection function 

Another potential source of distortion on the initial PDF 
comes from errors in the selection function. Given the highly 
correlated nature of the density field, a systematic error in 
the density estimate in a spherical shell which happens to 
contain some large voids or clusters can influence the shape 
of the initial PDF. To test for this, we have collected the se- 
lection functions (in the Local Group frame) used by ME99, 
Sutherland et al. (1999), Canavezes et al. (1998), Branchini 
et al. (1999), and Springel (1996). We have added to these 
our own determination of the selection function in the Local 
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Figure 6. Residuals of the selection functions taken from ME99 
(long-dashed), Sutherland et al. (1999) (dashed), Branchini et al. 
(1999) (dot-dashed), Canavezes et al. (1999) (dot-long dashed), 
Springel (1996) (short-long dashed), and determined by us in the 
Local Group frame (dotted), with respect to the one calculated 
by us in the CMB frame. 



Group frame, performed as described in Section 2. The dif- 
ferences between the various functions, which are of order 
of 10 percent, are due to the slightly different versions of 
the catalogue and mask used by the various authors, and to 
differences in the fitting procedure. Fig. 6 shows the residu- 
als of these selection functions with respect to the one used 
in this paper (Eq. [l]) . Notice that all the residuals show the 
same systematics because the selection functions are relative 
to the Local Group frame. Then, the effect of errors in the 
selection function is overestimated in this analysis. 

The first simulated PSCz catalogue (with no bias) has 
been reconstructed assuming either the correct or one of 
the six other selection functions listed above. Fig. 7 shows 
the seven reconstructed initial PDFs, compared with the 
average reconstructed initial PDF already shown in Fig. 3. 
Differently from Fig. 3, the errors shown are the (square 
root of the) variance between the various realizations (in- 
stead of the variance of the mean) . This is the correct error 
to use when assessing the difference between a single real- 
ization and the average initial PDF. Again, the lower panels 
show the significance of the disagreement. The various initial 
PDFs are hardly distinguishable, no significant distortion is 
induced. We conclude that the uncertainty in the selection 
function does not influence significantly the shape of the 
reconstructed initial PDF. 



The PSCz catalogue has been adaptively smoothed accord- 
ing to the smoothing strategy described in Section 3. The 
reference radius has been held constant, within D = 50 and 
80 h' 1 Mpc, to the values of R = 5.1 and 7.6 h' 1 Mpc. Fig. 
8 shows the reconstructed initial PDFs of the PSCz cata- 
logue compared with the results of the simulated catalogues 
described in Section 3.1 (with no bias and CMB selection 
function) and shown in Fig. 3. The error associated to the 
simulated initial PDFs is the square root of the variance 
between the realizations. 

The value of erg for the PSCz density field turns out to 
be about 0.8 (Sutherland et al. 1999; Seaborne et al. 1999), 
slightly larger than the value 0.7 used in our test simula- 
tion. As a consequence, the variance of the reconstructed 
initial PDF is slightly larger than the test one smoothed 
with the same radius. This has some effect on the spurious 
high-density cutoff of the PDF induced by ZTRACE. To 
compare reconstructed PDFs that have roughly the same 
variance, the PSCz has been smoothed over R — 7.2 and 
10.1 h' 1 Mpc within D = 75 and 105 h' 1 Mpc. Fig. 9 shows 
the comparison with the same PDFs of simulated catalogues 
as in Fig. 3 and 8. 

The reconstructed initial PDF of the PSCz catalogue 
disagrees significantly with the simulated one in Fig. 8a 
(smoothing radius R = 5.1 h~ x Mpc), where the standard 
deviation a of the two PDFs that are compared are different. 
Even in this case, significant distortions are visible only on 
the positive-density side, for (<5; — (8i))/a <; 1. In this range 
of densities the influence of the high-density cut (which is 
different in the two cases due to the difference in a) is strong. 
This discrepancy vanishes in Fig. 9a, when comparing the 
reconstructed PDFs of simulated and PSCz catalogues with 
similar variances. Besides, the distortions on the negative- 
density side, where we expect to find the signature of galaxy 
bias (see Fig. 4), are never significant, except a 3-cr discrep- 
ancy which is limited to one single bin. Figs. 8b and 9b show 
the same comparison for the larger smoothing radii. In this 
case, no strongly significant distortions are visible, although 
some modest but persistent discrepancy at the 3-cr level is 
present both on the positive- and on the negative-density 
sides. 

We conclude that there is no convincing evidence for 
non-Gaussianity in the initial PDF reconstructed from the 
PSCz catalogue. Notably, the overall pattern of the residuals 
is similar to the distortions induced by biased power-law 
or semi-analytic bias. Anyway, the statistical significance of 
these distortions is too marginal to draw any conclusion. 

The Gaussianity of the reconstructed inital PDF gives 
support to the hypotheses of Gaussianity of initial condi- 
tions and of modest bias of IRAS objects (as described in 
Section 4.2). In particular, the low-density tail of the initial 
PDF remains Gaussian down to fluctuations of ~2.5 times 
the standard deviation. The fact that 'the voids are too de- 
void' of bright galaxies has often been seen as a possible 
problem for CDM-like models of structure formation (see, 
e.g., Peebles 1999). The result presented here shows that 
the voids, as defined by IRAS galaxies, are as underdense as 
they are expected to be as an effect of gravitation, at least 
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Figure 7. Reconstructed initial PDF for the first simulated (unbiased) catalogue, compared to the average simulated initial PDF of Fig. 
3. The density field is reconstructed using either the correct selection function or one of the six alternatives listed in the text. The error 
associated to the simulated initial PDFs is the square root of the variance among the realizations. The lower panels show the significance 
of the disagreement between the given realization and the average. 



on scales as small as 5 h 1 Mpc, where the highly non- linear 
regime is still not dominant. 



6 SUMMARY AND CONCLUSIONS 

The ZTRACE algorithm has been applied to the new IRAS 
PSCz redshift catalogue to reconstruct the initial conditions 
of our local Universe. The ZTRACE algorithm, recently pro- 
posed by ME99, is known to give an unbiased estimate of the 
initial density which generates, through gravitational col- 
lapse in the Zel'dovich (1970) approximation, the observed 
galaxy density field in the redshift space under the assump- 
tion that galaxies trace mass. The reconstructed initial con- 
ditions of our local Universe can be used as input to N- 
body simulations, with the small-scale modes restored with 
a Monte-Carlo procedure (Kolatt et al. 1996; Narayanan et 
al. 1999). 

The results presented in this paper push the reconstruc- 
tion of initial conditions to scales of 5 ft -1 Mpc. This is 
close to the limit at which highly non-linear dynamics be- 
comes dominant, hampering any detailed inversion of the 
gravitational evolution. This paper improves by a factor of 
two in scale over the previous determination of the initial 
PDF, by Nusser et al. (1995). This improvement is due to 
the increased sampling density and depth of the new PSCz 
catalogue relative to previous IRAS-based catalogues such 
as QDOT (Lawrence et al. 1999) or 1.2 Jy Fisher et al. 
(1995), and to the superior performance of ZTRACE, com- 



pared to previous reconstruction algorithms (see ME99 and 
Narayanan & Croft 1999). 

The reconstructed initial PDF of the local Universe is 
Gaussian to within the accuracy of the method. The lack of 
detection of non-Gaussianity in the initial PDF supports the 
hypothesis that initial conditions are Gaussian and IRAS 
galaxies are not strongly biased with respect to the mat- 
ter density. Moreover, it shows that the voids are as devoid 
of bright galaxies as they should be as an effect of gravity 
(see, e.g., Peebles 1999). The Gaussian nature of the pri- 
mordial perturbations has been tested on much larger scales 
and with greater accuracy by measurements of the CMB. 
Then this result, besides offering a complementary support 
for Gaussianity on the scale of ~5-10 Mpc, gives origi- 
nal constraints on the nature of galaxy bias, which in general 
induces distortions on the initial PDF reconstructed under 
the no-bias hypothesis. However, our tests show that the dis- 
tortion induced by realistic power-law galaxy bias models, 
especially on the low-density tail, is just beyond the reach 
of the PSCz catalogue. Future surveys, like the 2dF or the 
SDSS, covering larger volume than PSCz, will provide a way 
to constrain galaxy bias beyond the linear level from the re- 
construction of the initial PDF. 
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Figure 8. Reconstructed initial PDFs for the PSCz catalogue compared with those for the simulated catalogues. The smoothing radii 
are R = 5.1 and 7.6 h~ 1 Mpc within D = 50 and 80 /i -1 Mpc. The dashed Gaussian line is shown for reference. The lower panels show 
the significance of the disagreement. 
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Figure 9. As in Fig. 8, but the PSCz catalogue is smoothed with R = 7.2 and 10.1 h' 1 Mpc within D = 75 and 105 h' 1 Mpc. The 
smoothing radii are chosen so that the variances of the initial PDFs for PSCz and simulated catalogues roughly coincide. 
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